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Abstract 

We measure the B° lifetime Tgo and the B°-B° oscillation frequency Am^ with a sample of ap- 
proximately 14,000 exclusively reconstructed B° — > D*~£ + vi signal events, selected from 23 million 
BB pairs recorded at the Y(4«S) resonance with the BABAR detector at the Stanford Linear Accel- 
erator Center. The 6-quark flavor of the other B at the time of decay and its decay position are 
determined inclusively. The lifetime and oscillation frequency are measured simultaneously with 
an unbinned maximum-likelihood fit that uses, for each event, the measured difference in B decay 
times (At), the calculated uncertainty on At, the signal and background probabilities, and 6-quark 
tagging information for the other B. The preliminary results are 

t b o = (1.523i£$| ± 0.022) ps 

and 

Am d = (0.492 ± 0.018 ± 0.013) ps" 1 . 
The statistical correlation coefficient between r^o and Am^ is —0.22. 
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1 Introduction and analysis overview 



The time evolution of B° mesons is governed by the overall decay rate 1/rgo and the B°-B° 
oscillation frequency Am^. The phenomenon of particle- antiparticle oscillations or "mixing" has 
been observed in neutral mesons containing a down quark and a strange quark {K mesons) or 
a bottom quark {B mesons) [|]J. In the Standard Model of particle physics, B mixing is the 
result of second-order charged weak interactions involving box diagrams containing virtual quarks 
with charge 2/3. In B mixing, the diagram containing the top quark dominates. Therefore, 
the mixing frequency Am^ is sensitive to the Cabibbo-Kobayashi-Maskawa quark-mixing matrix 
element Vtd @- In the neutral K meson system, mixing also has contributions from real intermediate 
states accessible to both a K° and a K° meson. These contributions are expected to be small for 
B mixing and are assumed to be negligible in this analysis. 

We present a measurement of the B° lifetime r^o and the oscillation frequency Am^ based on 
a sample of «14,000 exclusively reconstructed B° — > D*~i + V£ decaysf] selected from a sample of 23 
million BB events recorded at the T(AS) resonance with the BABAR detector at the Stanford Linear 
Accelerator Center, in 1999-2000. In this experiment, 9 GeV electrons and 3.1 GeV positrons, 
circulating in the PEP-II storage ring, annihilate to produce BB pairs moving along the e~ beam 
direction {z axis) with a known Lorentz boost of /?7 = 0.55, which allows a measurement of the 
time between the two B decays, At. 

The proper decay-time difference At between two neutral B mesons produced in a coherent 
P-wave state in an Y(4f>) event is governed by the following probabilities to observe an unmixed 
event, 

P(B°B° -> B°B°) oc e - |At|/T B°(l + cosAm d At), (1) 

or a mixed event, 

P{B°B° -> B°B° or B°B°) oc e H A *l/ r B° (1 - cos Am d At). (2) 

Therefore, if we measure At and identify the 6-quark flavor of both B mesons at their time of 
decay, we can extract the B° lifetime r^o and the mixing frequency Arrid- In this analysis, one 
B is reconstructed in the mode B° —* D*~£ + U£, which has a measured branching fraction of 
(4.60 ± 0.21)% ||]. Although the neutrino cannot be detected, the requirement of a reconstructed 
D*~ — > D°ir~ decay and a high-momentum lepton satisfying kinematic constraints consistent with 
the decay B° — > D*~l + V£ allows the isolation of a signal sample with (65 - 89)% purity, depending 
on the D° decay mode and whether the lepton candidate is an electron or a muon. The charges 
of the final-state particles identify the meson as a B° or a B°. The remaining charged particles in 
the event, which originate from the other B (referred to as -Btag)> are used to identify, or "tag", 
its flavor as a B or a B°. The time difference At = tr>*t — ttag ~ Az/ft'yc is determined from the 
separation Az of the decay vertices for the D*~l + candidate and the tagging B along the boost 
direction. The average separation is 250 fim. 

The oscillation frequency Am^ and the average lifetime of the neutral B meson, t b o , are deter- 
mined simultaneously with an unbinned maximum-likelihood fit to the measured At distributions 
of events that are classified as mixed and unmixed. This is in contrast to published measurements 
in which only Am^ is measured with r^o fixed to the world average, or only r^o is measured. There 
are several reasons to measure the lifetime and oscillation frequency simultaneously. The statistical 

1 Throughout this paper, charge conjugate modes are always implied. 
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precision for both t b o and Ara^ is comparable to the uncertainty on the world average. Therefore, 
it is appropriate to measure both quantities rather than fixing the lifetime to the world average. 
Since mixed and unmixed events have different At distributions, the mixing information for each 
event gives greater sensitivity to the At resolution function and a smaller statistical uncertainty 
on Tgo. Also, since B + B~ events do not mix, we can use the At distributions for mixed and 
unmixed events to help discriminate between B°B° signal events and B + B~ background events in 
the lifetime and mixing measurement. 

There are three main experimental complications that affect the At distributions given in Eqs.|l| 
and ||. First, the tagging algorithm, which classifies events into categories c depending on the source 
of the available tagging information, incorrectly identifies the flavor of -Btag with a probability 
w c with a consequent reduction of the observed amplitude for the mixing oscillation by a factor 
(1 — 2w c ). Second, the resolution for At is comparable to the lifetime and must be well understood. 
The probability density functions (PDF's) for the unmixed (+) and mixed (— ) signal events can 
be expressed as the convolution of the underlying At trU e distribution for tagging category c, 

g— |Attrue]/r B o 

[1 ± (1 - 2w c ) COS Am^Attrue], 

At b o 

with a resolution function TZ(At meas — At true ;q c ) that depends on a set of parameters q c . A final 
complication is that the sample of selected B — > D*~£ + V£ candidates is not pure signal. 

To characterize the backgrounds, we select control samples of events enhanced in each type of 
background and determine the signal and background probabilities for each event in the signal and 
background control samples as described in Sec. f|. The measurement of Az and the determination 
of At and aAt for each event is discussed in Sec. |f| The 6-quark tagging algorithm is described in 
bee. g In Sec. 0, we describe the unbinned maximum-likelihood fit. The physics model and At 
resolution function used to describe the measured At distribution for signal are given in Sec. || A 
combination of Monte Carlo simulation and data samples are used to determine the parameteri- 
zation of the PDF's to describe the At distribution for each type of background, as described in 
Sec. [| The likelihood is maximized in a simultaneous fit to the signal and background control 
samples to extract the B° lifetime t b o, the mixing frequency Am^, the mistag probabilities w c , 
the signal At resolution parameters q c , the background At model parameters, and the fraction 



of B + — > D* l + i>iX decays in the signal sample. The results of the fit are given in Sec. 10 



Cross-checks are described in Sec. 11 and systematic uncertainties are summarized in Sec. 



2 The BAR4R detector 

The BABAR detector is described in detail elsewhere The momenta of charged particles are 
measured with a combination of a 40-layer drift chamber (DCH) and a five-layer silicon vertex 
tracker (SVT) in a 1.5-T solenoidal magnetic field. A detector of internally-reflected Cherenkov 
radiation (DIRC) is used for charged hadron identification. Kaons are identified with a neural 
network based on the likelihood ratios calculated from dE/dx measurements in the SVT and 
DCH, and from the observed pattern of Cherenkov light in the DIRC. A finely-segmented CsI(Tl) 
electromagnetic calorimeter (EMC) is used to detect photons and neutral hadrons, and to identify 
electrons. Electron candidates are required to have a ratio of EMC energy to track momentum, 
an EMC cluster shape, DCH dE/dx, and DIRC Cherenkov angle all consistent with the electron 
hypothesis. The instrumented flux return (IFR) contains resistive plate chambers for muon and 
long-lived neutral hadron identification. Muon candidates are required to have IFR hits located 
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along the extrapolated DCH track, an IFR penetration length, and an energy deposit in the EMC 
consistent with the muon hypothesis. 

3 Data samples 

The data used in this analysis were recorded with the BABAR detector Q at the PEP-II storage 
ring || in the period October 1999 to December 2000. The total integrated luminosity of the data 
set is 20.6 fb" 1 collected at the T(45) resonance and 2.6 ftr 1 collected about 40 MeV below the 
T(4if?) (off-resonance data). The corresponding number of produced BB pairs is 23 million. 

Samples of Monte-Carlo simulated BB and cc events, generated with a GEANT3 [|| detector 
simulation, are analyzed through the same analysis chain as the real data to check for biases in the 
extracted physics parameters and are also used to develop models for describing detector resolution 
effects. The values of the parameters used in these models are determined with real data. The 
equivalent luminosity of this simulated data is approximately equal to that of the real data for BB 
events and about half that of real data for cc events. In addition, we generate signal Monte Carlo 
samples in which one neutral B meson in every event decays to D*~£ + U£, with D*~ —* D°ir~ , and 
the other neutral B meson decays generically. The D° then decays to one of the four final states 
reconstructed in this analysis (described in the next section). The equivalent luminosity of the 
simulated signal samples is equal to approximately 2 to 8 times that of the real data, depending 
on the D° decay mode. 

4 Event selection and characterization 

We select events containing a fully-reconstructed D*~ and an identified oppositely-charged electron 
or muon. This D*~£ + pair is then required to pass kinematic cuts that enhance the contribution 
of semileptonic B° — » D*~£ + V£ decays. In addition to the signal sample, we select several control 
samples that are used to characterize the main sources of background. 

We define the following classification of the sources of signal and background that we expect to 
contribute to this sample. The nomenclature shown in italics will be used throughout this paper 
to define signal and all possible types of background. 

1. Events with a correctly reconstructed D*~ candidate: 

(a) Events that originate from BB events: 

i. Events with a correctly identified lepton candidate: 

A. Signal- B° -> D*-£ + u t (X) decays. 

B. Uncorrelated-lepton background -(£>—> D*~X, other B —* £ + X) or (B — > 
D*-X, X -> £+Y) 

C. Charged B background - B + — > D*~£ + v e X. 

ii. Fake-lepton background - events with a misidentified lepton candidate. 

(b) Continuum background - cc — > D*~X. 

2. Combinatoric background - events with a misreconstructed D*~ candidate. 

Lepton candidates are defined as charged tracks with momentum in the T(45*) rest frame 
greater than 1.2 GeV. For the D*~e + samples, the electron candidate passes selection criteria 
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with a corresponding electron identification efficiency of about 90% and hadron misidentification 
less than 0.2%. For the D*~fi + samples, the muon candidate passes selection criteria with a 
corresponding muon identification efficiency of about 70% and hadron misidentification between 
2% and 3%. For the fake-lepton control sample, D*~l + candidates are accepted if the lepton fails 
both electron and muon selection criteria looser than those required for lepton candidates. 

D* candidates are selected in the decay mode D*~ — > D°ir~ . The D° candidate is reconstructed 
in the modes K~tt + , K~ir + -K~ir + , K~Tr + ir° and Kgir~ir + . The daughters of the D° decay are 
selected according to the following definitions. ir° candidates are reconstructed from two photons 
with invariant mass within 15.75 MeV of the ir° mass. The mass of the photon pair is constrained 
to the 7T° mass and the photon pair is kept as a ir° candidate if the x 2 probability of the fit is 
greater than 1%. Kg candidates are reconstructed from a pair of charged particles with invariant 
mass within 15 MeV of the K s mass. The pair of tracks is retained as a K s candidate if the x 2 
probability that the two tracks form a common vertex is greater than 1%. Charged kaon candidates 
satisfy loose kaon criteria for the K~ir + mode and tighter criteria for the K~ir + ir~ir + and K~ir + ir° 
modes. For the K~tt + tt° and K s ir~ir + modes, a likelihood is calculated as the square of the decay 
amplitude in the Dalitz plot for the three-body candidate, based on measured amplitudes and 
phases @. The candidate is retained if the likelihood is greater than 10% of its maximum value 
across the Dalitz plot. 

D° candidates have measured invariant mass within 17 MeV of the D° mass for the K~ir + , 
K~Tr + ir~Tr + , and K s tt^tt + modes, and within 34 MeV for the K~ir + Tr° mode. The invariant mass 
of the daughters is constrained to the D° mass and the tracks are constrained to a common vertex 
in a simultaneous fit. The D° candidate is retained if the x 2 °f the fit is greater than 0.1%. 

The low-momentum pion candidates for the D*~ — > D°n~ decay are selected with total mo- 
mentum less than 450 MeV in the T(4S) rest frame and momentum transverse to the beamline 
greater than 50 MeV. The momentum of the D* candidate in the Y(4«S) rest frame is between 0.5 
and 2.5 GeV. The D*~l + candidate satisfies | cos#j* hrust | < 0.85, where #* hrust is the angle between 
the thrust axis of the D*~£ + candidate and the thrust axis of the remaining charged and neutral 
particles in the event. D*~£ + candidates are retained if the \ 2 probability that the daughter tracks 
form a common vertex is greater than 1% and m(D*) — m(D°) is less than 165 MeV, where m(D*) 
is the candidate D°tt~ mass calculated with the candidate D° mass constrained to the true D° 
mass, m(D°). 

We define two angular quantities for each D*~£ + candidate to classify them into a sample 
enriched in B° — ► D*~t + i>£ signal events, in which the D* and lepton candidates are on opposite 
sides of the event, and a sample enriched in uncorrelated-lepton background events, in which the 
D* and lepton candidates are on the same side of the event. The first angle is Op* £, the angle 
between the D* and lepton candidates in the T(45) rest frame. The second angle is 9b,d*£, the 
angle between the direction of the B° and the vector sum of the D* and lepton candidate momenta, 
calculated in the T(4S*) rest frame. Since we do not know the direction of the B°, we calculate the 
cosine of 6b,d*i from the following equation, in which we assume that the only B decay particle 
missed in the reconstruction is a massless neutrino: 

cos 9 b ,dh = ^ — i • 3 

All quantities in Eq. ||| are defined in the T(4S I ) rest frame. The energy and the magnitude of 
the momentum of the B are calculated from the e + e~ center-of-mass energy and the B° mass. 
For true B° — > D*~l + V£ events, cos0B,D*e nes i n the physical region [— 1,+1], except for detector 
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resolution effects. Backgrounds lie inside and outside the range [— 1,+1]. We also calculate the 
same angle with the lepton momentum direction reflected through the origin in the T(4S) rest 
frame: 9 b ,d* (-<?)■ 

A sample enhanced in B° — ► D*~l + V£ signal events (called the opposite-side sample) is composed 
of D*~£ + candidates with cos#d*£ < and \ cos 9b,d*£\ < 1-1- Samples are defined for lepton 
candidates that satisfy the criteria for an electron, a muon and a fake-lepton. The first two samples 
are the signal samples, and the latter is the fake-lepton control sample. 

An additional background control sample, representative of the uncorrelated-lepton background 
and called the same-side sample, is composed of D*~l + candidates satisfying cos0£)*£ > and 
| cos#b,d*(-^)I < 1.1. We use cos9 BD *^_^ rather than cos 9 b,d*£ because, in Monte Carlo simula- 
tion, the distribution of cos 9 B D *^_^ in this control sample is similar to the distribution of cos 9b,d*£ 
for uncorrelated-lepton background in the signal sample, whereas the distribution of cos 9 b,d*i in 
the background control sample is systematically different. 

D*~l + candidates are retained if a fit of the lepton, ir~ , and D° candidates to a common vertex 
converges. In addition, several criteria that depend on the charged tracks in the rest of the event, 
as well as the D*~£ + candidate, are applied. Events are retained if at least two tracks are used to 
determine the decay point of the other B, the fit that determines the distance Az between the two 
B decays along the beamline converges, the time between decays (At) calculated from Az is less 
than 18 ps, and the calculated error on At (a At) is less than 1.8 ps. 

Approximately 68, 000 candidates pass the above selection criteria. These candidates are dis- 
tributed over two signal samples and ten background control samples defined by the following char- 
acteristics: whether the data was recorded on or off the T(45) resonance (two choices); whether 
the candidate lepton is same-side or opposite-side to the D* candidate (two choices); and whether 
the lepton candidate passes the criteria for an electron, a muon, or a fake lepton (three choices). 

The combinatoric background due to events with a misreconstructed D* candidate can be 
distinguished from events with a real D* in a plot of the mass difference m(D*) — m(D°). The 
m(D*) — m(D°) distributions for the samples of signal events (opposite-side D*~e + and D*~fi + 
candidates in on-resonance data) are shown as data points in Fig. |l] for electron candidates (left) 
and muon candidates (right). 

Each of the 12 samples described above are further divided into 30 subsamples according to the 
following characteristics that may affect the m(D*) — m(D°) or At distributions. 

1. The ir~ from the D* decay reconstructed in the SVT only, or in the SVT and DCH (two 
choices): The m(D*) — m(D°) resolution is worse when the ir~ is reconstructed only in the 
SVT. 

2. The D° candidate reconstructed in the mode K~ir + or /T _ 7r + 7r° or (K~it + tt~tt + or K^it~it + ) 
(three choices): The level of contamination from combinatoric background and the m(D*) — 
m(D°) resolution may depend on the D° decay mode. 

3. The 6-tagging information used for the other B (five choices; see Sec. ^): The level of con- 
tamination from each type of background and the At resolution parameters may depend on 
the tagging information. 

This allows subdivision into 360 samples. In the unbinned maximum likelihood fits to the m(D*) — 
m(D°) and (At, a At) distributions, individual fit parameters are shared among different sets of 
subsamples based on physics motivation and observations in the data. 
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Figure 1: m(D*) — m(D°) distribution for events passing all selection criteria for B° — > D*~£ + V£, 
with an electron (left) or muon (right) candidate. The points correspond to the data. The curve 
is the result of a simultaneous unbinned maximum likelihood fit to this sample of events and a 
number of background control samples. The shaded distributions correspond to the four types of 
background described in the text. 



We do a simultaneous fit to the m(D*) — m(D°) distributions for all 360 subsamples. The 
peak due to real D* candidates is modeled by the sum of two Gaussian distributions; the mean 
and variance of both the Gaussian distributions, as well as the relative normalization of the two 
Gaussians, are free parameters in the fit. We model the shape of the combinatoric background with 
the function 

1 [ / 5m — 5mo\ 
N i 1 " 6XP { ~ J 

where 5m = m(D*) — m(D°), N is a normalization constant, 5m$ is a kinematic threshold equal to 
the mass of the ir~ , and c\ and C2 are free parameters in the fit. An initial fit is done to determine 
the shape parameters describing the peak and combinatoric background. Separate values of the 
five parameters describing the shape of the peak are used for the six subsamples defined by whether 
the tt~ candidate is tracked in the SVT only or in the SVT and DCH, and the three types of D° 
decay modes. Each of these six groups that share peak parameters is further subdivided into twelve 
subgroups that each share a common set of the two combinatoric background shape parameters. 
Ten of these twelve subgroups are defined by the five tagging categories for the large signal samples 
and for the fake-lepton control samples, in on-resonance data. The other two subgroups are defined 
as same-side, on-resonance samples and all off-resonance samples. 

Once the peak and combinatoric background shape parameters have been determined, we fix 
the shape parameters and determine the peak and combinatoric background yields in each of the 
360 subsamples with an unbinned extended maximum-likelihood fit. The total peak yields in the 
signal sample and each background control sample are then used to determine the amount of true 
signal and each type of peaking background in the m{D*) — m(D°) peak of each sample as follows. 

1. Fake-lepton background - Particle identification and misidentification efficiencies for the 
electron, muon, and fake-lepton selection criteria are measured in data as a function of labo- 



/ 5m \ 
\5mo) 
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ratory momentum, polar angle, and azimuthal angle, for true electrons, muons, pions, kaons, 
and protons. B°B° and B + B~ Monte Carlo simulations are used to determine the measured 
laboratory momentum, polar angle, and azimuthal angle distributions for true electrons, 
muons, pions, kaons and protons that pass all selection criteria for D*~£ + candidates, except 
the lepton (or fake-lepton) identification criteria. These distributions are combined with the 
measured particle identification efficiencies and misidentification probabilities to determine 
the momentum- and angle-weighted probabilities for a true lepton or true hadron to pass the 
criteria for a lepton or a fake lepton in each of the D*~£ + signal and background control 
samples. We then use these efficiencies and misidentification probabilities, and the observed 
number of lepton and fake-lepton candidates in data, to determine the number of true leptons 
and fake leptons (hadrons) in each control sample. 

2. Uncorrelated-lepton background - To determine the number of uncorrelated-lepton events 
in each sample, we use the relative efficiencies from Monte Carlo simulation for signal and 
uncorrelated-lepton events to pass the criteria for same-side and opposite-side samples. 

3. Continuum background - We use the peak yields in off-resonance data, scaled by the relative 
integrated luminosity for on- and off-resonance data, to determine the continuum-background 
yields in on-resonance data. 

The peak yields and continuum, fake-lepton, and uncorrelated-lepton fractions of the peak 
yield, as well as the combinatoric fraction of all events in a m(D*) — m(D ) signal window, are 
shown in Table [l] for the signal and background control samples in on-resonance data. The peak 
yields include the peaking backgrounds. The signal window is defined as (143 - 148) MeV for the 
calculation of combinatoric background fractions. 

Table 1: Peak yields and continuum, fake-lepton, and uncorrelated-lepton fractions of the peak 
yield, and the combinatoric fraction of total events in a m(D*)—m(D°) signal window for the signal 
and background control samples in on-resonance data. Peak yields include the peaking backgrounds. 
The signal window for combinatoric background fractions is defined as (143 - 148) MeV. OS and 
SS refer to opposite-side and same-side samples; e, fi, and f indicate the type of lepton candidate: 
electron, muon or fake-lepton. 



Category 


Peak Yield 


/cont(%) 


/fake(%) 


/uncorr (%) 


/comb(%) 


OS 


e 


7008 ± 91 


1.53 ±0.42 


0.1678 ±0.0042 


3.14 ±0.39 


17.89 ± 0.24 






6569 ± 88 


2.27 ±0.57 


2.669 ± 0.067 


2.85 ± 0.48 


18.36 ±0.25 




f 


8770 ± 108 


12.8 ±1.3 


72.4 ± 1.8 


0.7 ± 1.6 


31.40 ±0.24 


SS 


e 


306 ± 21 


0.000 ± 0.006 


0.533 ± 0.039 


56.9 ± 7.0 


34.0 ± 1.3 






299 ± 20 


5.1 ±3.6 


8.89 ± 0.64 


48.9 ±8.0 


34.4 ± 1.3 




f 


1350 ± 45 


20.4 ±4.1 


74.4 ± 5.4 


3.6 ± 7.8 


42.59 ±0.61 



Finally, we use the calculated fractions and fitted shapes of the background sources in each 
control sample to estimate the probability of each candidate to be due to signal or each type of 
background (combinatoric, continuum, fake-lepton, or uncorrelated-lepton) when we fit the (At, 
o~ At) distribution to determine the lifetime and mixing parameters. We take advantage of the fact 
that charged and neutral B decays have different decay-time distributions (because the charged B 
does not mix) to determine the fraction of charged B background events in the fit to (At, a At)- 
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5 Decay-time measurement 



The decay-time difference At between B decays is determined from the measured separation Az = 
Z£)*i — ztag along the z axis between the D*~l + vertex position (zd*£) and the flavor-tagging decay 
-Btag vertex position (ztag)- This measured Az is converted into At with the use of the known 
T(4S) boost, determined for each run. Since we cannot reconstruct the direction of the B meson 
for each event, we use the excellent approximation At ~ Az/(f3 , yc). 

The momentum and position vectors of the D°, ir~ , and lepton candidates, and the average 
position of the e + e~ interaction point (called the beam spot) in the plane transverse to the beam are 
used in a constrained fit to determine the position of the D*~l + vertex. The beam-spot constraint 
is of order 100 /im in the horizontal direction and 30 fim in the vertical direction, corresponding to 
the RMS size of the beam in the horizontal direction and the approximate transverse flight path of 
the B in the vertical direction. The beam-spot constraint improves the resolution on zd*i by about 
30%. The RMS spread on the difference between the measured and true position of the D*~i + 
vertex is about 80 /im (0.5 ps). 

We determine the position of the -Btag vertex from all charged tracks in the event except the 
daughters of the D*~£ + candidate, using Kg and A candidates in place of their daughter tracks, 
and excluding tracks that are consistent with being due to photon conversions. The same beam- 
spot constraint applied to the I?d*£ vertex is also applied to the -Btag vertex. To reduce the 
influence of charm decay products, which bias the determination of the vertex position, tracks with 
a large contribution to the x 2 of the vertex fit are iteratively removed until those remaining have 
a reasonable fit probability or only one track remains. The RMS spread on the difference between 
the measured and true position of the -Btag vertex is about 160 fim (1.0 ps). Therefore, the At 
resolution is dominated by the z resolution of the tag vertex position. Events are retained if the 
fit converges, at least two tracks contribute to the determination of the tag vertex fit, the time 
between decays (At) calculated from Az is less than 18 ps, and the calculated error on At (oAt) is 
less than 1.8 ps. 

We calculate the uncertainty on Az due to uncertainties on the track parameters from the 
SVT and DCH hit resolution and multiple scattering, our knowledge of the beam-spot size, and 
the average B flight length in the vertical direction. The calculated uncertainty does not account 
for errors in pattern recognition in tracking, errors in associating tracks with the B vertex, or the 
effects of misalignment within and between the tracking devices. The calculated uncertainties will 
also be incorrect if our assumptions for the amount of material in the tracking detectors or the 
beam-spot size or position are inaccurate. We use parameters in the At resolution model, measured 
with data, to account for uncertainties and biases introduced by these effects. 

6 Flavor tagging 

All charged tracks in the event, except the daughter tracks of the D*~l + candidate, are used to 
determine whether the -Bt ag decayed as a B° or a B°. This is called flavor tagging. We use five 
different types of flavor tag, or tagging categories, in this analysis. The first two tagging categories 
rely on the presence of a prompt lepton, or one or more charged kaons, in the event. The other three 
categories exploit a variety of inputs with a neural-network algorithm. The tagging algorithms are 
described briefly in this section; see Ref. []|] for more details. 

Events are assigned a lepton tag if they contain an identified lepton with momentum in the 
T(4S) rest frame greater than 1.0 or 1.1 GeV for electrons and muons, respectively, thereby selecting 
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mostly primary leptons from the decay of the b quark. If the sum of charges of all identified kaons is 
nonzero, the event is assigned a kaon tag. The final three tagging categories are based on the output 
of a neural network that uses as inputs the momentum and charge of the track with the maximum 
center-of-mass momentum, the number of charged tracks with significant impact parameters with 
respect to the interaction point, and the outputs of three other neural networks, trained to identify 
primary leptons, kaons, and soft pions. Depending on the output of the main neural network, 
events are assigned to an NT1 (most certain), NT2, or NT3 (least certain) tagging category. About 
30% of events are in the NT3 category, which has a mistag rate close to 50%. Therefore, these 
events do not carry much sensitivity to the mixing frequency, but they increase the sensitivity to 
the B° lifetime. 

Tagging categories are mutually exclusive due to the hierarchical use of the tags. Events with 
a lepton tag and no conflicting kaon tag are assigned to the lepton category. If no lepton tag 
exists, but the event has a kaon tag, it is assigned to the kaon category. Otherwise events are 
assigned to corresponding neural network categories. 



7 Fit method 



We perform an unbinned fit simultaneously to events in each of the 12 signal and control samples 
(indexed by s) that are further subdivided into 30 subsamples (indexed by c) using a likelihood 
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30 N(s,c) 

nn n PsA^-,®, 



(5) 



s=l c=l k=l 



where k indexes the N(s,c) events Xj, in each of the 360 subsamples. The probability P s , c (^k ;p) 
of observing an event X]~ = (5m, At, a A t, g) is calculated as a function of the parameters p 

ijcomb -peak ^™ m K „ t 
iPs,c iPc 



/fcomb -comb -peak -, C omb fPkg f pkg f pkg -.pkg -.pkg -,pkg -.sigN 
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P StC (5m,At,a At ,g;p) = 

fsT h ■ ^ comb (5m;p^ b ) ■ g comb (At, a At , g ; £ c ° mb ) + (l - f s f>) ■ F^(5m;p, 



■ ^peak^j 



E fsS • StH^t, a At , g ; + 1 - ^ ) • G^(At, a At ,g; 

3=1 \ 3=1 



(6) 



where j indexes the three sources of peaking background and 5m is the mass difference m(D*) — 
m(D°) defined earlier. The index g is +1 (—1) for unmixed (mixed) events. By allowing different 
effective mistag rates for apparently mixed or unmixed events in the background functions g comh 
and Q pk& , we accomodate the different levels of backgrounds observed in mixed and unmixed 
samples. Functions labeled with T describe the probability of observing a particular value of 5m 
while functions labeled with Q give probabilities for values of At and a A t in category g. Parameters 
labeled with / describe the relative contributions of different types of events. Parameters labeled 
with p describe the shape of a 5m distribution, and those labeled with q describe a (At, a A t) shape. 

Note that we make explicit assumptions that the 5m peak shape, parameterized by ^P eak , and 
the signal (At, a At) shape, parameterized by (f c slg , depend only on the subsample index c. The 
first of these assumptions is supported by data, and simplifies the analysis of peaking background 
contributions. The second assumption reflects our expectation that the At distribution of signal 
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events does not depend on whether they are selected in the signal sample or appear as a background 
in a control sample. 

The ultimate aim of the fit is to obtain the B° lifetime and mixing frequency, which by construc- 
tion are common to all sets of signal parameters g,f g . Most of the statistical power for determining 
these parameters comes from the signal sample, although the fake and uncorrelated background 
control samples also contribute due to their signal content (see Table [l]) . 

We bootstrap the full fit with a sequence of initial fits using reduced likelihood functions to 
a partial set of samples, to determine the appropriate parameterization of the signal resolution 
function and the background At models, and to determine starting values for each parameter in 
the full fit. 

1. We first find a model that describes the At distribution for each type of event: signal, 
combinatoric background, and the three types of backgrounds that peak in the m(D*)—m(D°) 
distribution. To establish a model, we use Monte Carlo samples that have been selected to 
correspond to only one type of signal or background event based on Monte Carlo truth 
information. These samples are used to determine the At model and the categories of events 
(e.g., tagging category, fake or real lepton) that can share each of the parameters in the 
model. Any subset of parameters can be shared among any subset of the 360 subsamples. We 
choose parameterizations and sharing of parameters that minimizes the number of different 
parameters while still providing an adequate description of the At distributions. 

2. We then find the starting values for the background parameters by fitting to each of the 
background-enhanced control samples in data, using the model (and sharing of parameters) 
determined in the previous step. Since these background control samples are not pure, we start 
with the purest control sample (combinatoric background events from the m(D*) — m(D°) 
sideband) and move on to less pure control samples, always using the models established from 
earlier steps to describe the At distribution of the contamination from other backgrounds. 

The result of the above two steps is a At model for each type of event and a set of starting 
values for all parameters in the fit. When we do the final fit, we fit all signal and control samples 
simultaneously (~68k events), leaving all parameters free in the fit (72 free parameters). The 
physics parameters t b o and Am^ were kept hidden until all analysis details and the systematic 
errors were finalized, to eliminate experimenter's bias. However, statistical errors on the parameters 
and changes in the physics parameters due to changes in the analysis were not hidden. 



8 Signal At model 

For signal events in a given tagging category c, the probability density function (PDF) for At 
consists of a physics model convolved with a At resolution function: 

g si ^At,a At ,g;^) = J^ e H Alta »l/ T s°(l + $(1 _ 2b j c ) cos(Am rf At true ))l ® U(5At, a At ; q c ) , 

where 1Z is a resolution function, which can be different for different event categories, g is +1 (— 1) 
for unmixed (mixed) events, and <5At is the residual At — At trU e- The physics model shown in 
the above equation has seven parameters: Am^, t b o, and mistag fractions lo c for each of the five 
tagging categories. To account for an observed correlation between the mistag rate and o"At in the 
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kaon category (described in Sec. |8,1] ), we allow the mistag rate in the kaon category to vary as a 
linear function of aAt'- 

, offset 

Wkaon = "T-kaon " & At + <^kaon ■ {<) 

In addition, we allow the mistag fractions for B° tags and B° tags to be different. We define 
Aoj = LOgo — LUgo and uj = (u B o + lo^ )/2, so that 

U B°/B° = ^ ± ^Aw. 

There are 13 free parameters in the complete physics model for all tagging categories. 

For the At resolution model, we use the sum of a single Gaussian distribution and the same 
Gaussian convolved with a one-sided exponential to describe the core part of the resolution function, 
plus a single Gaussian distribution to describe the contribution of "outliers" — events in which the 
reconstruction error 5 At is not described by the calculated uncertainty a At'- 

n GExp+G (5At, a At ; s, k, f, 6° ut , s out , / out ) 

= / • G(5At; 0, s(j At ) + (1 - / - /° ut ) • G(u - 5At; 0, sa At ) ® E(u; kit At) (8) 
+ f ont -G(5At;b ont ,s out ) , 

where u is an integration variable in the convolution G®E. The functions G and E are defined by 

G(x; x , a) = * exp (-(x - x ) 2 / (2a) 2 ) 
\J2ira 

and 



\ f -exp(x/a) if x < 0, 
v ; \ if x > 0. 



Since the outlier contribution is not expected to be described by the calculated error on each 
event, the last Gaussian term in Eq. [8| does not depend on aAt- However, in the terms that describe 
the core of the resolution function (the first two terms on the right-hand side of Eq. ||) , the Gaussian 
width s and the effective decay constant k are scaled by aAt- The scale factor s is introduced to 
accommodate an overall underestimate (s > 1) or overestimate (s < 1) of the errors for all events. 
The decay constant k is introduced to account for residual charm decay products included in the 
-Btag vertex; k is scaled by aAt to account for a correlation observed in Monte Carlo simulation 
between the mean of the 5 At distribution and the measurement error a At- This correlation is due 
to the fact that, in B decays, the vertex error ellipse for the D decay products is oriented with 
its major axis along the D flight direction, leading to a correlation between the D flight direction 
and the calculated uncertainty on the vertex position in z for the -Btag candidate. In addition, the 
flight length of the D in the z direction is correlated with its flight direction. Therefore, the bias 
in the measured -Btag position due to including D decay products is correlated with the D flight 
direction. Taking into account these two correlations, we conclude that D mesons that have a flight 
direction perpendicular to the z axis in the laboratory frame will have the best z resolution and 
will introduce the least bias in a measurement of the z position of the -Btag vertex, while D mesons 
that travel forward in the laboratory will have poorer z resolution and will introduce a larger bias 
in the measurement of the B tag vertex. 

The mean and RMS spread of At residual distributions in Monte Carlo simulation vary signifi- 
cantly among tagging categories. We find that we can account for these differences by allowing the 
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core Gaussian fraction / to be different for each tagging category. In addition, we find that the 
correlations among the three parameters describing the outlier Gaussian (6 out , s out , / out ) are large 
and that the outlier parameters are highly correlated with other resolution parameters. Therefore, 
we fix the outlier bias 6 out and scale factor s out , and vary them over a wide range to evaluate the 
systematic uncertainty on the physics parameters due to fixing these parameters (see Sec. 12). The 



resolution model then has 8 free parameters: s, k, / out , and five fractions f c (one for each tagging 
category c). 

As a cross-check, we also use a resolution function that is the sum of a narrow and a wide 
Gaussian distribution, and a third Gaussian to describe outliers: 

K G+G+G (5At, a At ; b, s, /, b w , s w , 6 out , S out , / out ) 

= / • G(5At; ha*, ac M ) + (1 - / - f ut ) ■ G(5At; b w a At , s w a At ) + f° ut ■ G(SAt; b out ,s out ) . 



8.1 Vertex-tagging correlations 

A correlation of about 0.12 ps _1 is observed between the mistag rate and the At resolution for 
kaon tags. This effect is modeled in the resolution function for signal as a linear dependence of the 
mistag rate on cr A t, as shown in Eq. 0. In this section, we describe the source of this correlation. 

We find that both the mistag rate for kaon tags and the calculated error on At depend inversely 
on \jY,pl, where pt is the transverse momentum with respect to the z axis of tracks from the -Btag 
decay. Correcting for this dependence of the mistag rate removes most of the correlation between 
the mistag rate and a A t- The mistag rate dependence originates from the kinematics of the physics 
sources for wrong-charge kaons. The three major sources of mistagged events in the kaon category 
are wrong-sign D° mesons from B decays to double charm, wrong-sign kaons from D + decays, and 
kaons produced directly in B decays. All these sources produce a spectrum of charged tracks that 
have smaller than B decays that produce a correct tag. The a A t dependence originates from 

the 1/pt dependence of a z for the individual contributing tracks. 

9 At models for backgrounds 

Although the true At and resolution on At are not well-defined for background events, we still 
describe the total At model as a "physics model" convolved with a "resolution function". 

The background At physics models we use in this analysis are each a linear combination of one 
or more of the following terms, corresponding to prompt (zero lifetime), exponential lifetime, and 
oscillatory distributions: 



^ s (At true , 5 ) = ±6(At tine ).(l+g.(l-^)) 



g^ ys (At tIUC ,g) = exp(-|At truc |/r b s).(l + 9 .(l-^ ifc )), 



1 

6^(AW, 9) = exp(-| At true |/r b s) • (l + g ■ (1 - co osc ) cos Am h ^At tIUC ) , 



4 r bg 

where 6 (At) is a ^-function, g = +1 for unmixed and —1 for mixed events, and r bg and Am bg are 
the effective lifetime and mixing frequency for the particular background. 
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For backgrounds, we use a resolution function that is the sum of a narrow and a wide Gaussian 
distribution: 

K G+G (8At,a At ;b,s,f,b w ,s w ) 

= f ■ G(5At; ba At ,sa At ) + (!-/)■ G(SAt; b w a At , s w a At ) . 



9.1 Combinatoric background 

Events in which the D* candidate corresponds to a random combination of charged tracks (called 
combinatoric background) constitute the largest background in the signal sample. We use two 
sets of events to determine the appropriate parameterization of the At model for combinatoric 
background: events in data that are in the upper m(D*) — m(D°) sideband (above the peak 
due to real D* decays); and events in Monte Carlo simulation that are identified as combinatoric 
background, based on the true information for the event, in both the m(D*) — m(D°) sideband 
and peak region. The data and Monte Carlo At distributions are described well by a prompt plus 
oscillatory term convolved with a double-Gaussian resolution function: 



'comb 



/ osc • <?° s h c ys (At truc , <?;T comb , Am com V osc ) + (1 - / osc ) • 6j(At trac , g; ^ mt ) 
K G+G (6At,a At ;b,s,f,b w ,s w ) . 



(9) 



The parameters KJ pmt , Am comb , r comb , /, b w , and s w are shared among all control samples. The 
parameters u> osc , / osc , b, and s are allowed to be different depending on criteria such as tagging 
category, whether the data was recorded on- or off-resonance, whether the candidate lepton passes 
real- or fake-lepton criteria, whether the event passes the criteria for same-side or opposite-side D* 
and £, and how many identified leptons are in the event. The total number of free parameters in 
the combinatoric background At model is 24. 

The relative fraction of B°B° and B + B~ events in the combinatoric background depends 
slightly on m(D*) — m(D°). However, no significant dependence of the parameters of the At 
model on m(D*) — m(D°) is observed in data or Monte Carlo simulation. The sample of events 
in the m(D*) — m(D°) sideband is used to determine the starting values for the parameters in the 
final full fit to all data samples. 

To reduce the total number of free parameters in the fit, parameters that describe the shape 
of the wide Gaussian (bias and width) are shared between combinatoric background and the three 
types of peaking background: continuum, fake-lepton, and uncorrelated-lepton. The wide Gaussian 
fraction is allowed to be different for each type of background. 



9.2 Continuum peaking background 

All cc events that have a correctly reconstructed D* are defined as continuum peaking background, 
independent of whether the associated lepton candidate is a real lepton or a fake lepton. The cc 
Monte Carlo sample and off-resonance data are used to identify the appropriate At model and 
sharing of parameters among subsamples. The combinatoric-background At model and parameters 
described in the previous section are used to model the combinatoric-background contribution in 
the off-resonance At distribution in data. 

Events with a real D* from continuum cc production should have vanishing At in the case 
of perfect reconstruction. Therefore, we use the following model for the At distribution of these 
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events: 

Dependence on the flavor tagging information is included to accomodate any differences in the 
amount of background events classified as mixed and unmixed. 

By fitting to the data and Monte Carlo control samples with different sharing of parameters 
across subsets of the data, we find that the apparent "mistag fraction" for events in the kaon 
category is significantly different from the mistag fraction for other tagging categories. We also 
find that the core Gaussian bias is significantly different for opposite-side and same-side events. 
We introduce separate parameters to accommodate these effects. 

The total number of parameters used to describe the At distribution of continuum peaking 
background is six. The off-resonance control samples in data are used to determine starting values 
for the final full fit to all data samples. 



9.3 Fake-lepton peaking background 

To determine the At model and sharing of parameters for the fake-lepton peaking backgrounds, 
we use B°B° and B + B~ Monte Carlo events in which the D* is correctly reconstructed but the 
lepton candidate is misidentified. In addition, we use the fake-lepton control sample in data. 
The combinatoric and continuum peaking background At models and parameters described in the 
previous two sections are used to model their contribution to the fake-lepton At distribution in 
data. For this study, the contribution of signal is described by the signal parameters found for 
signal events in the Monte Carlo simulation. 

Since the fake-lepton peaking background is due to B decays in which the fake lepton and the 
D* candidate can originate from the same B or different B mesons, we include both prompt and 
oscillatory terms in the At model: 



giake 



/° SC • Sphys + (1 " /° SC ) • SpTys ® ^G+G(5At, CM' b, S, /, V 



We find that the apparent mistag rates for both the prompt and mixing terms, and the bias of the 
core Gaussian of the resolution function, are different between some tagging categories. The total 
number of parameters used to describe the fake-lepton background is 14. The fake-lepton control 
samples in data are used to determine starting values for the final full fit to all data samples. 



9.4 Uncorrelated-lepton peaking background 

To determine the At model and sharing of parameters for the uncorrelated-lepton peaking back- 
grounds, we use B°B° and B + B~ Monte Carlo events in which the D* is correctly reconstructed but 
the lepton candidate is from the other B in the event or from a secondary decay of the same B. In 
addition, we use the same-side control sample in data, which is only about 30% uncorrelated-lepton 
background in the m(D*) — m(D ) peak region due to significant contributions from combinatoric 
background and signal. The combinatoric and other peaking background At models and parame- 
ters described in the previous two sections are used to model their contribution to the same-side 
At distribution in data. For this initial study, the contribution of signal is described by the signal 
parameters found for signal events in the Monte Carlo simulation. 

Physics and vertex reconstruction considerations suggest several features of the At distribution 
for the uncorrelated-lepton sample. First, we expect the reconstructed At to be systematically 
smaller than the true At value since using a lepton and a D* from different B decays will generally 
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reduce the separation between the reconstructed -Bd*£ and B tag vertices. We also expect that events 
with small true At will have a higher probability of being misreconstructed as an uncorrelated lepton 
candidate because it is more likely that the fit of the D* and I to a common vertex will converge for 
these events. Finally, we expect truly mixed events to have a higher fraction of uncorrelated-lepton 
events because in mixed events the charge of the D* is opposite that of primary leptons on the 
tagging side. These expectations are confirmed in the Monte Carlo simulation. 

We do not expect the uncorrelated-lepton background to exhibit any mixing behavior and none 
is observed in the data or Monte Carlo control samples. We describe the At distribution with the 
sum of a lifetime term and a prompt term, convolved with a double-Gaussian resolution function: 

<? uncor = [/ Hfe • a^ C y s(AWe,<?;r uncor , W lifc ) + (1 - / lifc ) • eJ s (At trac , J; ^)] ® 

n G+G (SAt,a At ;b, S> f,b w ,s w ) . (10) 

The effective mistag rates co pmt and u; llfe accommodate different fractions of uncorrelated-lepton 
backgrounds in events classified as mixed and unmixed. We find that the apparent mistag rate for 
the lifetime term is different between some tagging categories. All other parameters are consistent 
among the different subsamples. The total number of parameters used to describe the uncorrelated- 
lepton background is six. The uncorrelated-lepton control samples in data are used to determine 
starting values for the final full fit to all data samples. 

9.5 Charged B peaking background 

The charged- B peaking background is due to decays of the type — > D*£viX. Since charged -B's 
do not exhibit mixing behavior, our strategy is to use the At and tagging information to discriminate 
charged- B peaking background events from neutral-S signal events, in the simultaneous fit to all 
samples. We use the same resolution model and parameters as for the neutral-S signal since the 
decay dynamics are very similar. The signal model, with the charged B background term, becomes 

Q si z(At, a At ,g; = ~ ^ + e -|Afeue|/r B o ( x + g{1 _ 2uJ c o) cos(Am(iAttrue )) + 

4t b o 

fB^ e -\At tIUC \/r B+ (1 + g{1 _ 2u c ^ 3 n{§A ^ aM . ^ ? 

where uj b0 {oj c b+ ) is the mistag fraction for neutral (charged) B mesons for tagging category c. 

Given that the ratio of the charged B to neutral B lifetime is close to 1 and the fraction of 
charged B mesons in the peaking sample is small, we do not have sufficient sensitivity to distinguish 
the lifetimes in the fit. We parameterize the physics model for the B + in terms of the lifetime ratio 
t b +/t b o, and fix this ratio to the Review of Particle Properties 2002 world average Q. We vary 
the ratio by the error on the world average to estimate the corresponding systematic uncertainties 
on t b o and Am^ (see Sec. 12). 



The fit is sensitive to only two parameters among u B +, lo b o and the charged B fraction (f B +). 
Therefore we fix the ratio of mistag rates, uj b +/uj b o, to the value of the ratio measured with fully 
reconstructed charged and neutral hadronic B decays in data, for each tagging category. 

10 Fit results 

The total number of free parameters in the final fit is 72: 22 in the signal model, 24 in the 
combinatoric background model, and 26 in peaking background models. The fitted signal At 
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model parameters are shown in Table ^. 



Table 2: Results of full fit to data — signal model and resolution function parameters. A small 



Signal Model and At Resolution Function Parameters 


parameter 


value 


parameter 


value 


parameter 


value 


Anid (ps -1 ) 


0.492 ±0.018 


/b+ 


0.082 ±0.029 


s 


1.201 ±0.063 


T B (ps) 


1 crno+0.024 
i - tJZO -0.Q23 


^lepton 


0.071 ±0.015 


K 


0.86 ±0.17 






offset 
kaon 


0.002 ± 0.024 


/lepton 


0.72 ±0.10 






^kaon 


0.229 ± 0.036 


/kaon 


0.609 ± 0.088 






WNT1 


0.212 ±0.020 


/nti 


0.69 ±0.13 






WNT2 


0.384 ±0.018 


/NT2 


0.70 ±0.10 






WNT3 


0.456 ± 0.012 


/NT3 


0.723 ± 0.078 






Awiepton 


-0.001 ±0.022 


yout 


0.0027 ±0.0017 






AcJkaon 


-0.024 ±0.015 


6 out (ps) 


-5.000 






A(JNT1 


-0.098 ± 0.032 


s out (ps) 


6.000 






Al<JnT2 


-0.112 ±0.028 










Al<JnT3 


-0.023 ±0.019 







The statistical correlation coefficient between r B o and Am^ is p(Amd,T B o) = —0.22. The 
global correlation coefficients for t b o and Am^, and some of the correlation coefficients between 
t b o or Amd and other parameters, are shown in Table [|. 

Table 3: Global correlation coefficients for Amj and t b o from the full fit to data and other corre- 
lation coefficients for pairs of key parameters in the fit. 



Am,; global correlation 


0.74 


t b o global correlation 


0.69 


p(Am d ,T B o) 


-0.22 


p(Am d ,f B +) 


0.58 




-0.49 




-0.26 



Figure |2| shows the At distributions for unmixed and mixed events in the signal sample (opposite- 
side D*-lepton candidates in on-resonance data). The points correspond to data. The curves corre- 
spond to the sum of the projections of the appropriate relative amounts of signal and background 
At models for the signal sample in the dm range between 143 and 148 MeV. Figure || shows the 
asymmetry 

_ Unmixed (At) — -/V mixe d(At) 
A r unm ixed(At) + iV m ixed(At) 

The unit amplitude for the cosine dependence of A is diluted by the mistag probability, the exper- 
imental At resolution, and backgrounds. 

Since we float many parameters in the model, it is interesting to see how the errors on t b o and 
Amd, and their correlation change when different parameters are free in the fit, or fixed to their 
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Figure 2: The At distribution for unmixed and mixed events in the signal sample (opposite-side 
-D*-lepton candidates in on-resonance data) and the projection of the fit results. The left-hand 
plots are for unmixed events; the right-hand plots for mixed events. The shaded area shows the 
background contribution to the distributions. 



best value from the full fit. We perform a series of fits, fixing all parameters at the values obtained 
from the default fit, except (a) Am<j and t b o, (b) Am^, t b o, and all mistag fractions in the signal 
model, (c) Amj, t b o, and f B +, (d) Am^, t b o, f B +, and all mistag fractions in the signal model, 
(e) all parameters in the signal At model. The one-sigma error ellipses for these fits and for the 
default fit are shown in Fig. ||. 

We can see that the error on t b o changes very little until we float the signal resolution function. 
Floating the background parameters adds a very small contribution to the error. The contribution 
from the charged B fraction and mistag fractions to the r B o error is negligible. On the other hand, 
the charged B fraction changes the error on Am<j the most. The contributions from floating the 
mistag fractions, resolution functions, and background At models are relatively small. 



11 Validation and cross checks 



In Sec. we describe several tests of the fitting procedure that were performed with both fast 
parameterized Monte Carlo simulations and full detector simulations. In Sec. 11.2, we give the 
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Figure 3: The asymmetry plot for mixed and unmixed events in the signal sample (opposite-side D*- 
lepton candidates in on-resonance data) in the 8m range from 143 to 148 MeV, and the projection 
of the fit results. Errors on the data points are computed by considering the binomial probabilities 
for observing different numbers of mixed and unmixed events while preserving the total number. 

results of performing cross-checks on data, including fitting to different subsamples of the data and 
fitting with variations to the standard fit. 

11.1 Tests of fitting procedure with Monte Carlo simulations 

A test of the fitting procedure is performed with fast parameterized Monte Carlo simulations, where 
87 experiments are generated with signal and background control sample sizes and compositions 
corresponding to that obtained from the full likelihood fit to data. The mistag rates and At 
distributions are generated according to the model used in the likelihood fit. The full fit is then 
performed on each of these experiments. We find no statistically significant bias in the average 
values of t b o and Am^ for the 87 fits. The RMS spread in the distribution of results is consistent 
with the mean statistical error from the fits and the statistical error on the results in data, for both 
t b o and Am^. We find that 17 of the fits to the 87 experiments result in a value of the negative 
log likelihood that is smaller (better) than that found in data. We also check the statistical errors 
on data by measuring the increase in negative log likelihood in data in the two-dimensional (t b o, 
Arrid) space in the vicinity of the minimum of the negative log likelihood. We found that the 
positive error on t b q is about 6% larger than that predicted by the fitting program, whereas the 
other errors are the same as predicted. We increased the positive statistical error on t b o by 6%. 

We also fit two types of Monte Carlo samples that include full detector simulation: pure signal 
and signal plus background. To check whether the selection criteria introduce any bias in the 
lifetime or mixing frequency, we fit the signal physics model to the true lifetime distribution, using 
true tagging information, for a large sample of signal Monte Carlo events that pass all selection 
criteria. We also fit the measured At distribution, using measured tagging information, with the 
complete signal At model described in Sec. ||. We find no statistically significant bias in the values 
of t b d or Arrid extracted in these fits. 
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Figure 4: Comparison of one-sigma error ellipses in the Am^-T^o plane for fits in which different 
sets of parameters are free. From the innermost to the outermost ellipse, the floating parameters 
are (Am^, r B o), (Am^, t b o, mistag fractions), (Am^, t b o, f B +), (Amj, t b o, f B +, mistag fractions), 
all signal At parameters, and the default fit (72 floating parameters). 

The B°B°, B + B~, and cc Monte Carlo samples that provide simulated background events 
along with signal events are much smaller than the pure signal Monte Carlo samples. In addition, 
they are not much larger than the data samples. In order to increase the statistical sensitivity to 
any bias introduced when the background samples are added to the fit, we compare the values of 
t b o and Arrid from the fit to signal plus background events, and pure signal events from the same 
sample. We find that when background is added, the value of t b o increases by (0.022±0.009) ps and 
the value of Amj increases by (0.020 ±0.005) ps -1 , where the error is the difference in quadrature 
between the statistical errors from the fit with and without background. We correct our final results 
in data for these biases, which are each roughly the same size as the statistical error on the results 
in data. We conservatively apply a systematic uncertainty on this bias equal to the full statistical 
error on the measured result in Monte Carlo simulation with background: ±0.018 ps for t b o and 
±0.012 ps" 1 for Am d . 

11.2 Cross-checks in data 

We perform the full maximum-likelihood fit on different subsets of the data and find no statistically 
significant difference in the results for different subsets. The fit is performed on datasets divided 
according to tagging category, 6-quark flavor of the D*~l + candidate, 6-quark flavor of the tagging 
B, and D° decay mode. We also vary the range of At over which we perform the fit (maximum 



26 



value of |At| equal to 10, 14, and 18 ps), and decrease the maximum allowed value of cr^t from 
1.8 ps to 1.4 ps. Again, we do not find statistically significant changes in the values of r B o or 
Am d . 

12 Systematic studies 

We estimate systematic uncertainties on the parameters t b o and Am<j with studies performed on 
both data and Monte Carlo samples, and obtain the results summarized in Table ^j. 

Table 4: Summary of systematic uncertainties on the two physics parameters, t b o and Amd- 



Source 


5(Am d ) (ps" 1 ) 


S(t b o) (ps) 


Selection and fit bias 


0.0123 


0.0178 


z scale 


0.0020 


0.0060 


PEP-II boost 


0.0005 


0.0015 


Beam spot position 


0.0010 


0.0050 


SVT alignment 


0.0030 


0.0056 


Background / signal prob. 


0.0029 


0.0032 


Background At models 


0.0012 


0.0063 


Fixed B + jB lifetime ratio 


0.0003 


0.0019 


Fixed B + /B° mistag ratio 


0.0001 


0.0003 


Fixed signal outlier shape 


0.0010 


0.0054 


Signal resolution model 


0.0009 


0.0034 


Total systematic error 


0.013 


0.022 



The largest source of systematic uncertainty on both parameters is the limited statistical preci- 
sion for determining the bias due to the fit procedure (in particular, the background modeling) with 
Monte Carlo events. We assign the statistical errors of a full fit to Monte Carlo samples including 
background to estimate this systematic uncertainty. See Sec. 11.1 for more details. 

The calculation of a decay-time difference At for each event assumes a nominal detector z- 
scale, PEP-II boost, vertical beam-spot position, and SVT internal alignment. We vary each of 
these assumptions and assign the variation in the fitted parameters as a corresponding systematic 
uncertainty. 

The modeling of the background contributions to the sample determines the probability we as- 
sign for each event to be due to signal and the At distribution we expect for background events. We 
estimate the systematic uncertainty due to the assumed background At distributions as the shift 
in the fitted parameters when we replace the model for the largest background (due to combinatoric 
events) with a pure lifetime model. We estimate the uncertainty due to the signal probability cal- 
culations by repeating the full fit using an ensemble of different signal and background parameters 
for the m(D*) — m(D 1 ) distributions, varied randomly according to the measured statistical un- 
certainties and correlations between the parameters. We assign the spread in each of the resulting 
fitted physics parameter as the systematic uncertainty. 

The model of the charged B background assumes fixed B + / B° ratios for the mistag rates and 
lifetimes. We vary the mistag ratio by the uncertainty determined from separate fits to hadronic 
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events. We vary the lifetime ratio by the statistical uncertainty on the world average H. The 
resulting change in the fitted physics parameters is assigned as a systematic uncertainty. 

The final category of systematic uncertainties is due to assumptions about the resolution model 
for signal events. We largely avoid assumptions by floating many parameters to describe the 
resolution simultaneously with the parameters of interest. However, two sources of systematic 
uncertainty remain: the shape of the outlier contribution, which cannot be determined from data 
alone, and the assumed parameterization of the resolution for non-outlier events. We study the 
sensitivity to the outlier shape by repeating the full fit with an ensemble of different shapes, 
and assign the spread of the resulting fitted values as a systematic uncertainty. We estimate the 
uncertainty due to the assumed resolution parameterization by repeating the full fit with a triple- 
Gaussian resolution model and assigning the shift in the fitted values as the uncertainty. 

The total systematic uncertainty on t b o is 0.022 ps and on Arrid is 0.013 ps -1 . 

13 Summary 

We use a sample of approximately 14,000 exclusively reconstructed B° —* D*~£ + v/> signal events 
to measure the B° lifetime t b o and oscillation frequency Am,; simultaneously, with an unbinned 
maximum-likelihood fit. The preliminary results are 

t b0 = (1.523±Hi ± 0.022) ps 

and 

Am d = (0.492 ± 0.018 ± 0.013) ps -1 . 

The statistical correlation coefficient between t b o and Am^ is —0.22. Both the lifetime and mixing 
frequency have combined statistical and systematic uncertainties that are comparable to those of the 
most precise previously-published experimental measurements ||. The results are consistent with 
the world average measurements of t b o = (1.542 ± 0.016) ps and Am^ = (0.489 ± 0.008) ps -1 ||. 
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